Efficient Advertisement Discovery for Audio Podcast Content Using Candidate Segmentation
نویسندگان
چکیده
Nowadays, audio podcasting has been widely used by many online sites such as newspapers, web portals, journals, and so forth, to deliver audio content to users through download or subscription. Within 1 to 30 minutes long of one podcast story, it is often that multiple audio advertisements (ads) are inserted into and repeated, with each of a length of 5 to 30 seconds, at different locations. Automatic detection of these attached ads is a challenging task due to the complexity of the search algorithms. Based on the knowledge of typical structures of podcast contents, this paper proposes a novel efficient advertisement discovery approach for large audio podcasting collections. The proposed approach offers a significant improvement on search speed with sufficient accuracy. The key to the acceleration comes from the advantages of candidate segmentation and sampling technique introduced to reduce both search areas and number of matching frames. The approach has been tested over a variety of podcast contents collected from MIT Technology Review, Scientific American, and Singapore Podcast websites. Experimental results show that the proposed algorithm archives detection rate of 97.5% with a significant computation saving as compared to existing state-of-the-art methods.
منابع مشابه
ZemPod: A semantic web approach to podcasting
In this paper we present a semantic web approach to solve some current limitations of podcasting. The main shortcomings of podcasts are two. The first one is that there is no formal description of the contents of a podcast session, apart from a textual description only available in HTML. The second problem is that a podcast session consists of a single audio file. Thus, it is very difficult to ...
متن کاملAudio-Video Based Segmentation and Classification using AANN
This paper presents a method to classify audio-video data into one of seven classes: advertisement, cartoon, news, movie, and songs. Automatic audio-video classification is very useful to audio-video indexing, content based audio-video retrieval. Mel frequency cepstral coefficients are used to characterize the audio data. The color histogram features extracted from the images in the video clips...
متن کاملPodcast vs. Pamphlet; Which One Is More Effective in Self-care Training Program of Diabetic Patients?
Background and Purpose: Self-care can help patients with diabetes to reduce complications of the disease. The purpose of this study was to compare the effectiveness of diabetes self-care educational programs through tow podcast and pamphlet methods. Material and Methods: The present study was quasi-experimental research conducted in Tehran Aboozar Diabetes Center (2014). 90 patients wi...
متن کاملUsing Term Clouds to Represent Segment-Level Semantic Content of Podcasts
Spoken audio, like any time-continuous medium, is notoriously difficult to browse or skim without support of an interface providing semantically annotated jump points to signal the user where to listen in. Creation of time-aligned metadata by human annotators is prohibitively expensive, motivating the investigation of representations of segment-level semantic content based on transcripts genera...
متن کاملUnsupervised word discovery from speech using automatic segmentation into syllable-like units
This paper presents a syllable-based approach to unsupervised pattern discovery from speech. By first segmenting speech into syllable-like units, the system is able to limit potential word onsets and offsets to a finite number of candidate locations. These syllable tokens are then described using a set of features and clustered into a finite number of syllable classes. Finally, recurring syllab...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- EURASIP J. Audio, Speech and Music Processing
دوره 2010 شماره
صفحات -
تاریخ انتشار 2010